Dataset statistics
| Number of variables | 28 |
|---|---|
| Number of observations | 431.290 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 83.9 MiB |
| Average record size in memory | 204.0 B |
Variable types
| Categorical | 19 |
|---|---|
| Numeric | 9 |
year has constant value "2015" | Constant |
dishwasher has constant value "NA" | Constant |
country has a high cardinality: 60 distinct values | High cardinality |
wealth has a high cardinality: 47482 distinct values | High cardinality |
escs has a high cardinality: 52733 distinct values | High cardinality |
country_2 has a high cardinality: 60 distinct values | High cardinality |
country_name has a high cardinality: 60 distinct values | High cardinality |
country_3 has a high cardinality: 60 distinct values | High cardinality |
school_id is highly correlated with country and 7 other fields | High correlation |
student_id is highly correlated with country and 7 other fields | High correlation |
math is highly correlated with read and 1 other fields | High correlation |
read is highly correlated with math and 1 other fields | High correlation |
science is highly correlated with math and 1 other fields | High correlation |
stu_wgt is highly correlated with country and 4 other fields | High correlation |
rank is highly correlated with country and 9 other fields | High correlation |
finalIq is highly correlated with country and 9 other fields | High correlation |
pop2021 is highly correlated with country and 8 other fields | High correlation |
year is highly correlated with desk and 15 other fields | High correlation |
country is highly correlated with school_id and 18 other fields | High correlation |
mother_educ is highly correlated with country and 8 other fields | High correlation |
father_educ is highly correlated with country and 8 other fields | High correlation |
gender is highly correlated with year and 1 other fields | High correlation |
computer is highly correlated with country and 12 other fields | High correlation |
internet is highly correlated with country and 12 other fields | High correlation |
desk is highly correlated with country and 12 other fields | High correlation |
room is highly correlated with country and 11 other fields | High correlation |
dishwasher is highly correlated with desk and 15 other fields | High correlation |
television is highly correlated with country and 10 other fields | High correlation |
computer_n is highly correlated with country and 12 other fields | High correlation |
car is highly correlated with country and 10 other fields | High correlation |
book is highly correlated with country and 9 other fields | High correlation |
country_2 is highly correlated with country and 18 other fields | High correlation |
country_name is highly correlated with country and 18 other fields | High correlation |
country_3 is highly correlated with country and 18 other fields | High correlation |
student_id has unique values | Unique |
Reproduction
| Analysis started | 2022-10-28 19:32:53.013132 |
|---|---|
| Analysis finished | 2022-10-28 19:37:45.071012 |
| Duration | 4 minutes and 52.06 seconds |
| Software version | pandas-profiling v3.4.0 |
| Download configuration | config.json |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| 2015 |
|---|
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1.725.160 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2015 |
|---|---|
| 2nd row | 2015 |
| 3rd row | 2015 |
| 4th row | 2015 |
| 5th row | 2015 |
Common Values
| Value | Count | Frequency (%) |
| 2015 | 431290 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 2015 | 431290 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 431290 | |
| 0 | 431290 | |
| 1 | 431290 | |
| 5 | 431290 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1725160 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 431290 | |
| 0 | 431290 | |
| 1 | 431290 | |
| 5 | 431290 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1725160 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 431290 | |
| 0 | 431290 | |
| 1 | 431290 | |
| 5 | 431290 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1725160 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 431290 | |
| 0 | 431290 | |
| 1 | 431290 | |
| 5 | 431290 |
| Distinct | 60 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| BRA | 23141 |
|---|---|
| CAN | 20058 |
| AUS | 14530 |
| ARE | 14167 |
| GBR | 14157 |
| Other values (55) |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1.293.870 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ISR |
|---|---|
| 2nd row | ISR |
| 3rd row | ISR |
| 4th row | ISR |
| 5th row | ISR |
Common Values
| Value | Count | Frequency (%) |
| BRA | 23141 | 5.4% |
| CAN | 20058 | 4.7% |
| AUS | 14530 | 3.4% |
| ARE | 14167 | 3.3% |
| GBR | 14157 | 3.3% |
| QAT | 12083 | 2.8% |
| COL | 11795 | 2.7% |
| ITA | 11583 | 2.7% |
| BEL | 9651 | 2.2% |
| THA | 8249 | 1.9% |
| Other values (50) | 291876 |
Length
| Value | Count | Frequency (%) |
| bra | 23141 | 5.4% |
| can | 20058 | 4.7% |
| aus | 14530 | 3.4% |
| are | 14167 | 3.3% |
| gbr | 14157 | 3.3% |
| qat | 12083 | 2.8% |
| col | 11795 | 2.7% |
| ita | 11583 | 2.7% |
| bel | 9651 | 2.2% |
| tha | 8249 | 1.9% |
| Other values (50) | 291876 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 149516 | 11.6% |
| A | 143566 | 11.1% |
| N | 95098 | 7.3% |
| L | 82082 | 6.3% |
| E | 79483 | 6.1% |
| U | 79479 | 6.1% |
| T | 73263 | 5.7% |
| S | 72899 | 5.6% |
| B | 62638 | 4.8% |
| C | 57164 | 4.4% |
| Other values (16) | 398682 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1293870 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 149516 | 11.6% |
| A | 143566 | 11.1% |
| N | 95098 | 7.3% |
| L | 82082 | 6.3% |
| E | 79483 | 6.1% |
| U | 79479 | 6.1% |
| T | 73263 | 5.7% |
| S | 72899 | 5.6% |
| B | 62638 | 4.8% |
| C | 57164 | 4.4% |
| Other values (16) | 398682 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1293870 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 149516 | 11.6% |
| A | 143566 | 11.1% |
| N | 95098 | 7.3% |
| L | 82082 | 6.3% |
| E | 79483 | 6.1% |
| U | 79479 | 6.1% |
| T | 73263 | 5.7% |
| S | 72899 | 5.6% |
| B | 62638 | 4.8% |
| C | 57164 | 4.4% |
| Other values (16) | 398682 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1293870 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 149516 | 11.6% |
| A | 143566 | 11.1% |
| N | 95098 | 7.3% |
| L | 82082 | 6.3% |
| E | 79483 | 6.1% |
| U | 79479 | 6.1% |
| T | 73263 | 5.7% |
| S | 72899 | 5.6% |
| B | 62638 | 4.8% |
| C | 57164 | 4.4% |
| Other values (16) | 398682 |
| Distinct | 15386 |
|---|---|
| Distinct (%) | 3.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 41570919.19 |
| Minimum | 800001 |
|---|---|
| Maximum | 85800222 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 800001 |
|---|---|
| 5-th percentile | 3600578 |
| Q1 | 17000312 |
| median | 40000005 |
| Q3 | 64300023 |
| 95-th percentile | 82600178 |
| Maximum | 85800222 |
| Range | 85000221 |
| Interquartile range (IQR) | 47299711 |
Descriptive statistics
| Standard deviation | 26260221.53 |
|---|---|
| Coefficient of variation (CV) | 0.631696918 |
| Kurtosis | -1.316732116 |
| Mean | 41570919.19 |
| Median Absolute Deviation (MAD) | 23400100 |
| Skewness | 0.08255531682 |
| Sum | 1.792912174 × 1013 |
| Variance | 6.895992347 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 63400102 | 635 | 0.1% |
| 44200001 | 293 | 0.1% |
| 44200016 | 258 | 0.1% |
| 44200024 | 256 | 0.1% |
| 63400051 | 246 | 0.1% |
| 63400012 | 244 | 0.1% |
| 44200038 | 231 | 0.1% |
| 63400152 | 226 | 0.1% |
| 63400076 | 226 | 0.1% |
| 49900036 | 226 | 0.1% |
| Other values (15376) | 428449 |
| Value | Count | Frequency (%) |
| 800001 | 32 | |
| 800002 | 2 | < 0.1% |
| 800003 | 6 | < 0.1% |
| 800004 | 29 | |
| 800005 | 33 | |
| 800006 | 4 | < 0.1% |
| 800007 | 25 | |
| 800008 | 11 | < 0.1% |
| 800009 | 9 | < 0.1% |
| 800010 | 33 |
| Value | Count | Frequency (%) |
| 85800222 | 37 | |
| 85800221 | 32 | |
| 85800220 | 24 | |
| 85800219 | 18 | |
| 85800218 | 18 | |
| 85800217 | 34 | |
| 85800216 | 36 | |
| 85800215 | 36 | |
| 85800214 | 33 | |
| 85800213 | 21 |
| Distinct | 431290 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 41576510.58 |
| Minimum | 800001 |
|---|---|
| Maximum | 85807641 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 800001 |
|---|---|
| 5-th percentile | 3615411.9 |
| Q1 | 17011121.25 |
| median | 40000143.5 |
| Q3 | 64300611.75 |
| 95-th percentile | 82605243.55 |
| Maximum | 85807641 |
| Range | 85007640 |
| Interquartile range (IQR) | 47289490.5 |
Descriptive statistics
| Standard deviation | 26259080.54 |
|---|---|
| Coefficient of variation (CV) | 0.6315845215 |
| Kurtosis | -1.316711293 |
| Mean | 41576510.58 |
| Median Absolute Deviation (MAD) | 23408606 |
| Skewness | 0.08267475679 |
| Sum | 1.793153325 × 1013 |
| Variance | 6.895393109 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 37603048 | 1 | < 0.1% |
| 5603017 | 1 | < 0.1% |
| 5603893 | 1 | < 0.1% |
| 5603441 | 1 | < 0.1% |
| 5600699 | 1 | < 0.1% |
| 5600100 | 1 | < 0.1% |
| 5605890 | 1 | < 0.1% |
| 5601592 | 1 | < 0.1% |
| 5604675 | 1 | < 0.1% |
| 5606346 | 1 | < 0.1% |
| Other values (431280) | 431280 |
| Value | Count | Frequency (%) |
| 800001 | 1 | |
| 800002 | 1 | |
| 800003 | 1 | |
| 800004 | 1 | |
| 800005 | 1 | |
| 800006 | 1 | |
| 800007 | 1 | |
| 800008 | 1 | |
| 800009 | 1 | |
| 800010 | 1 |
| Value | Count | Frequency (%) |
| 85807641 | 1 | |
| 85807640 | 1 | |
| 85807639 | 1 | |
| 85807638 | 1 | |
| 85807637 | 1 | |
| 85807635 | 1 | |
| 85807634 | 1 | |
| 85807633 | 1 | |
| 85807632 | 1 | |
| 85807631 | 1 |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| ISCED 3A | |
|---|---|
| ISCED 3B, C | |
| ISCED 2 | |
| ISCED 1 | |
| NA |
Length
| Max length | 16 |
|---|---|
| Median length | 8 |
| Mean length | 8.233541237 |
| Min length | 2 |
Characters and Unicode
| Total characters | 3.551.044 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ISCED 3A |
|---|---|
| 2nd row | ISCED 3A |
| 3rd row | NA |
| 4th row | ISCED 3A |
| 5th row | ISCED 3B, C |
Common Values
| Value | Count | Frequency (%) |
| ISCED 3A | 217087 | |
| ISCED 3B, C | 76868 | 17.8% |
| ISCED 2 | 70227 | 16.3% |
| ISCED 1 | 27845 | 6.5% |
| NA | 24708 | 5.7% |
| less than ISCED1 | 14555 | 3.4% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| isced | 392027 | |
| 3a | 217087 | |
| 3b | 76868 | 8.3% |
| c | 76868 | 8.3% |
| 2 | 70227 | 7.6% |
| 1 | 27845 | 3.0% |
| na | 24708 | 2.7% |
| less | 14555 | 1.6% |
| than | 14555 | 1.6% |
| isced1 | 14555 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 498005 | ||
| C | 483450 | |
| I | 406582 | |
| S | 406582 | |
| E | 406582 | |
| D | 406582 | |
| 3 | 293955 | |
| A | 241795 | |
| B | 76868 | 2.2% |
| , | 76868 | 2.2% |
| Other values (10) | 253775 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2453149 | |
| Space Separator | 498005 | 14.0% |
| Decimal Number | 406582 | 11.4% |
| Lowercase Letter | 116440 | 3.3% |
| Other Punctuation | 76868 | 2.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 483450 | |
| I | 406582 | |
| S | 406582 | |
| E | 406582 | |
| D | 406582 | |
| A | 241795 | |
| B | 76868 | 3.1% |
| N | 24708 | 1.0% |
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 29110 | |
| l | 14555 | |
| e | 14555 | |
| t | 14555 | |
| h | 14555 | |
| a | 14555 | |
| n | 14555 |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 293955 | |
| 2 | 70227 | 17.3% |
| 1 | 42400 | 10.4% |
Space Separator
| Value | Count | Frequency (%) |
| 498005 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 76868 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2569589 | |
| Common | 981455 | 27.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 483450 | |
| I | 406582 | |
| S | 406582 | |
| E | 406582 | |
| D | 406582 | |
| A | 241795 | |
| B | 76868 | 3.0% |
| s | 29110 | 1.1% |
| N | 24708 | 1.0% |
| l | 14555 | 0.6% |
| Other values (5) | 72775 | 2.8% |
Common
| Value | Count | Frequency (%) |
| 498005 | ||
| 3 | 293955 | |
| , | 76868 | 7.8% |
| 2 | 70227 | 7.2% |
| 1 | 42400 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3551044 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 498005 | ||
| C | 483450 | |
| I | 406582 | |
| S | 406582 | |
| E | 406582 | |
| D | 406582 | |
| 3 | 293955 | |
| A | 241795 | |
| B | 76868 | 2.2% |
| , | 76868 | 2.2% |
| Other values (10) | 253775 |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| ISCED 3A | |
|---|---|
| ISCED 3B, C | |
| ISCED 2 | |
| NA | |
| ISCED 1 |
Length
| Max length | 16 |
|---|---|
| Median length | 11 |
| Mean length | 8.193187878 |
| Min length | 2 |
Characters and Unicode
| Total characters | 3.533.640 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ISCED 3A |
|---|---|
| 2nd row | NA |
| 3rd row | NA |
| 4th row | ISCED 3B, C |
| 5th row | NA |
Common Values
| Value | Count | Frequency (%) |
| ISCED 3A | 197530 | |
| ISCED 3B, C | 87009 | |
| ISCED 2 | 71479 | 16.6% |
| NA | 32041 | 7.4% |
| ISCED 1 | 28870 | 6.7% |
| less than ISCED1 | 14361 | 3.3% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| isced | 384888 | |
| 3a | 197530 | |
| 3b | 87009 | 9.3% |
| c | 87009 | 9.3% |
| 2 | 71479 | 7.7% |
| na | 32041 | 3.4% |
| 1 | 28870 | 3.1% |
| less | 14361 | 1.5% |
| than | 14361 | 1.5% |
| isced1 | 14361 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 500619 | ||
| C | 486258 | |
| I | 399249 | |
| S | 399249 | |
| E | 399249 | |
| D | 399249 | |
| 3 | 284539 | |
| A | 229571 | |
| B | 87009 | 2.5% |
| , | 87009 | 2.5% |
| Other values (10) | 261639 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2431875 | |
| Space Separator | 500619 | 14.2% |
| Decimal Number | 399249 | 11.3% |
| Lowercase Letter | 114888 | 3.3% |
| Other Punctuation | 87009 | 2.5% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 486258 | |
| I | 399249 | |
| S | 399249 | |
| E | 399249 | |
| D | 399249 | |
| A | 229571 | |
| B | 87009 | 3.6% |
| N | 32041 | 1.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 28722 | |
| l | 14361 | |
| e | 14361 | |
| t | 14361 | |
| h | 14361 | |
| a | 14361 | |
| n | 14361 |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 284539 | |
| 2 | 71479 | 17.9% |
| 1 | 43231 | 10.8% |
Space Separator
| Value | Count | Frequency (%) |
| 500619 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 87009 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2546763 | |
| Common | 986877 | 27.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 486258 | |
| I | 399249 | |
| S | 399249 | |
| E | 399249 | |
| D | 399249 | |
| A | 229571 | |
| B | 87009 | 3.4% |
| N | 32041 | 1.3% |
| s | 28722 | 1.1% |
| l | 14361 | 0.6% |
| Other values (5) | 71805 | 2.8% |
Common
| Value | Count | Frequency (%) |
| 500619 | ||
| 3 | 284539 | |
| , | 87009 | 8.8% |
| 2 | 71479 | 7.2% |
| 1 | 43231 | 4.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3533640 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 500619 | ||
| C | 486258 | |
| I | 399249 | |
| S | 399249 | |
| E | 399249 | |
| D | 399249 | |
| 3 | 284539 | |
| A | 229571 | |
| B | 87009 | 2.5% |
| , | 87009 | 2.5% |
| Other values (10) | 261639 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| female | |
|---|---|
| male |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.005717731 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2.158.916 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | female |
|---|---|
| 2nd row | female |
| 3rd row | female |
| 4th row | female |
| 5th row | female |
Common Values
| Value | Count | Frequency (%) |
| female | 216878 | |
| male | 214412 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| female | 216878 | |
| male | 214412 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 648168 | |
| m | 431290 | |
| a | 431290 | |
| l | 431290 | |
| f | 216878 | 10.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2158916 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 648168 | |
| m | 431290 | |
| a | 431290 | |
| l | 431290 | |
| f | 216878 | 10.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2158916 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 648168 | |
| m | 431290 | |
| a | 431290 | |
| l | 431290 | |
| f | 216878 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2158916 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 648168 | |
| m | 431290 | |
| a | 431290 | |
| l | 431290 | |
| f | 216878 | 10.0% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| yes | |
|---|---|
| no | |
| NA | 19268 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.80993531 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1.211.897 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NA |
|---|---|
| 2nd row | NA |
| 3rd row | NA |
| 4th row | NA |
| 5th row | NA |
Common Values
| Value | Count | Frequency (%) |
| yes | 349317 | |
| no | 62705 | 14.5% |
| NA | 19268 | 4.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| yes | 349317 | |
| no | 62705 | 14.5% |
| na | 19268 | 4.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| y | 349317 | |
| e | 349317 | |
| s | 349317 | |
| n | 62705 | 5.2% |
| o | 62705 | 5.2% |
| N | 19268 | 1.6% |
| A | 19268 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1173361 | |
| Uppercase Letter | 38536 | 3.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| y | 349317 | |
| e | 349317 | |
| s | 349317 | |
| n | 62705 | 5.3% |
| o | 62705 | 5.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 19268 | |
| A | 19268 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1211897 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| y | 349317 | |
| e | 349317 | |
| s | 349317 | |
| n | 62705 | 5.2% |
| o | 62705 | 5.2% |
| N | 19268 | 1.6% |
| A | 19268 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1211897 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| y | 349317 | |
| e | 349317 | |
| s | 349317 | |
| n | 62705 | 5.2% |
| o | 62705 | 5.2% |
| N | 19268 | 1.6% |
| A | 19268 | 1.6% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| yes | |
|---|---|
| no | |
| NA | 19327 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.847241995 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1.227.987 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NA |
|---|---|
| 2nd row | NA |
| 3rd row | NA |
| 4th row | NA |
| 5th row | NA |
Common Values
| Value | Count | Frequency (%) |
| yes | 365407 | |
| no | 46556 | 10.8% |
| NA | 19327 | 4.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| yes | 365407 | |
| no | 46556 | 10.8% |
| na | 19327 | 4.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| y | 365407 | |
| e | 365407 | |
| s | 365407 | |
| n | 46556 | 3.8% |
| o | 46556 | 3.8% |
| N | 19327 | 1.6% |
| A | 19327 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1189333 | |
| Uppercase Letter | 38654 | 3.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| y | 365407 | |
| e | 365407 | |
| s | 365407 | |
| n | 46556 | 3.9% |
| o | 46556 | 3.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 19327 | |
| A | 19327 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1227987 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| y | 365407 | |
| e | 365407 | |
| s | 365407 | |
| n | 46556 | 3.8% |
| o | 46556 | 3.8% |
| N | 19327 | 1.6% |
| A | 19327 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1227987 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| y | 365407 | |
| e | 365407 | |
| s | 365407 | |
| n | 46556 | 3.8% |
| o | 46556 | 3.8% |
| N | 19327 | 1.6% |
| A | 19327 | 1.6% |
| Distinct | 224921 |
|---|---|
| Distinct (%) | 52.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 458.052289 |
| Minimum | 0 |
|---|---|
| Maximum | 860.903 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 291.717 |
| Q1 | 385.14425 |
| median | 457.938 |
| Q3 | 530.844 |
| 95-th percentile | 624.57065 |
| Maximum | 860.903 |
| Range | 860.903 |
| Interquartile range (IQR) | 145.69975 |
Descriptive statistics
| Standard deviation | 102.006902 |
|---|---|
| Coefficient of variation (CV) | 0.2226970686 |
| Kurtosis | -0.3197276195 |
| Mean | 458.052289 |
| Median Absolute Deviation (MAD) | 72.85 |
| Skewness | -0.00141234106 |
| Sum | 197553371.7 |
| Variance | 10405.40806 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 482.817 | 11 | < 0.1% |
| 361.954 | 10 | < 0.1% |
| 392.042 | 10 | < 0.1% |
| 368.967 | 10 | < 0.1% |
| 437.911 | 10 | < 0.1% |
| 384.647 | 10 | < 0.1% |
| 468.493 | 10 | < 0.1% |
| 441.804 | 10 | < 0.1% |
| 398.712 | 9 | < 0.1% |
| 503.436 | 9 | < 0.1% |
| Other values (224911) | 431191 |
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 8.473 | 1 | |
| 28.435 | 1 | |
| 29.428 | 1 | |
| 31.604 | 1 | |
| 39.283 | 1 | |
| 41.607 | 1 | |
| 52.377 | 1 | |
| 52.627 | 1 | |
| 54.473 | 1 |
| Value | Count | Frequency (%) |
| 860.903 | 1 | |
| 852.526 | 1 | |
| 847.23 | 1 | |
| 846.55 | 1 | |
| 842.615 | 1 | |
| 842.477 | 1 | |
| 841.922 | 1 | |
| 834.321 | 1 | |
| 830.881 | 1 | |
| 829.383 | 1 |
| Distinct | 232996 |
|---|---|
| Distinct (%) | 54.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 462.8403368 |
| Minimum | 0 |
|---|---|
| Maximum | 861.854 |
| Zeros | 4 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 284.94125 |
| Q1 | 389.542 |
| median | 466.2455 |
| Q3 | 539.10575 |
| 95-th percentile | 630.22455 |
| Maximum | 861.854 |
| Range | 861.854 |
| Interquartile range (IQR) | 149.56375 |
Descriptive statistics
| Standard deviation | 105.7651463 |
|---|---|
| Coefficient of variation (CV) | 0.2285132429 |
| Kurtosis | -0.2557301969 |
| Mean | 462.8403368 |
| Median Absolute Deviation (MAD) | 74.6505 |
| Skewness | -0.1479724736 |
| Sum | 199618408.8 |
| Variance | 11186.26617 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 422.693 | 10 | < 0.1% |
| 361.062 | 10 | < 0.1% |
| 419.162 | 9 | < 0.1% |
| 452.839 | 9 | < 0.1% |
| 475.697 | 9 | < 0.1% |
| 415.723 | 9 | < 0.1% |
| 507.49 | 9 | < 0.1% |
| 522.363 | 9 | < 0.1% |
| 527.936 | 9 | < 0.1% |
| 423.564 | 9 | < 0.1% |
| Other values (232986) | 431198 |
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 5.384 | 1 | < 0.1% |
| 9.294 | 1 | < 0.1% |
| 16.466 | 1 | < 0.1% |
| 26.806 | 1 | < 0.1% |
| 34.198 | 1 | < 0.1% |
| 36.158 | 1 | < 0.1% |
| 40.692 | 1 | < 0.1% |
| 41.899 | 1 | < 0.1% |
| 43.92 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 861.854 | 1 | |
| 851.085 | 1 | |
| 850.75 | 1 | |
| 846.678 | 1 | |
| 844.291 | 1 | |
| 833.919 | 1 | |
| 832.034 | 1 | |
| 829.295 | 1 | |
| 827.036 | 1 | |
| 827.026 | 1 |
| Distinct | 199531 |
|---|---|
| Distinct (%) | 46.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 465.5219762 |
| Minimum | 25.103 |
|---|---|
| Maximum | 888.359 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.3 MiB |
Quantile statistics
| Minimum | 25.103 |
|---|---|
| 5-th percentile | 303.9999 |
| Q1 | 390.4915 |
| median | 462.747 |
| Q3 | 538.477 |
| 95-th percentile | 635.587 |
| Maximum | 888.359 |
| Range | 863.256 |
| Interquartile range (IQR) | 147.9855 |
Descriptive statistics
| Standard deviation | 102.0113656 |
|---|---|
| Coefficient of variation (CV) | 0.2191332973 |
| Kurtosis | -0.3932461886 |
| Mean | 465.5219762 |
| Median Absolute Deviation (MAD) | 73.939 |
| Skewness | 0.1039951968 |
| Sum | 200774973.1 |
| Variance | 10406.31871 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 467.58 | 12 | < 0.1% |
| 461.453 | 12 | < 0.1% |
| 429.74 | 11 | < 0.1% |
| 502.319 | 11 | < 0.1% |
| 435.278 | 11 | < 0.1% |
| 468.474 | 11 | < 0.1% |
| 397.446 | 10 | < 0.1% |
| 351.894 | 10 | < 0.1% |
| 437.947 | 10 | < 0.1% |
| 449.427 | 10 | < 0.1% |
| Other values (199521) | 431182 |
| Value | Count | Frequency (%) |
| 25.103 | 1 | |
| 58.748 | 1 | |
| 66.498 | 1 | |
| 75.748 | 1 | |
| 78.33 | 1 | |
| 88.16 | 1 | |
| 88.265 | 1 | |
| 88.655 | 1 | |
| 98.086 | 1 | |
| 101.954 | 1 |
| Value | Count | Frequency (%) |
| 888.359 | 1 | |
| 876.746 | 1 | |
| 871.481 | 1 | |
| 870.02 | 1 | |
| 863.713 | 1 | |
| 860.276 | 1 | |
| 859.783 | 1 | |
| 856.619 | 1 | |
| 856.247 | 1 | |
| 855.72 | 1 |
| Distinct | 33214 |
|---|---|
| Distinct (%) | 7.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54.45132543 |
| Minimum | 1 |
|---|---|
| Maximum | 2160.911 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1.07692 |
| Q1 | 5.83732 |
| median | 13.86692 |
| Q3 | 56.88371 |
| 95-th percentile | 200.8544 |
| Maximum | 2160.911 |
| Range | 2159.911 |
| Interquartile range (IQR) | 51.04639 |
Descriptive statistics
| Standard deviation | 109.4962558 |
|---|---|
| Coefficient of variation (CV) | 2.01090157 |
| Kurtosis | 35.09835109 |
| Mean | 54.45132543 |
| Median Absolute Deviation (MAD) | 11.68725 |
| Skewness | 4.858503871 |
| Sum | 23484312.14 |
| Variance | 11989.43003 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 5558 | 1.3% |
| 32.64061 | 810 | 0.2% |
| 4.54703 | 767 | 0.2% |
| 1.02139 | 762 | 0.2% |
| 1.38571 | 562 | 0.1% |
| 1.0625 | 523 | 0.1% |
| 56.88371 | 431 | 0.1% |
| 59.80729 | 429 | 0.1% |
| 1.06667 | 420 | 0.1% |
| 5.22017 | 327 | 0.1% |
| Other values (33204) | 420701 |
| Value | Count | Frequency (%) |
| 1 | 5558 | |
| 1.00069 | 9 | < 0.1% |
| 1.00313 | 33 | < 0.1% |
| 1.00503 | 123 | < 0.1% |
| 1.00719 | 139 | < 0.1% |
| 1.00806 | 124 | < 0.1% |
| 1.01 | 100 | < 0.1% |
| 1.01031 | 194 | < 0.1% |
| 1.01047 | 191 | < 0.1% |
| 1.01087 | 48 | < 0.1% |
| Value | Count | Frequency (%) |
| 2160.911 | 12 | |
| 1968.532 | 8 | |
| 1890.579 | 10 | |
| 1766.916 | 3 | < 0.1% |
| 1743.317 | 3 | < 0.1% |
| 1717.269 | 4 | < 0.1% |
| 1603.934 | 16 | |
| 1548.798 | 5 | < 0.1% |
| 1442.697 | 2 | < 0.1% |
| 1439.287 | 13 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| yes | |
|---|---|
| no | |
| NA | 19674 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.833803241 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1.222.191 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | yes |
|---|---|
| 2nd row | yes |
| 3rd row | NA |
| 4th row | yes |
| 5th row | yes |
Common Values
| Value | Count | Frequency (%) |
| yes | 359611 | |
| no | 52005 | 12.1% |
| NA | 19674 | 4.6% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| yes | 359611 | |
| no | 52005 | 12.1% |
| na | 19674 | 4.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| y | 359611 | |
| e | 359611 | |
| s | 359611 | |
| n | 52005 | 4.3% |
| o | 52005 | 4.3% |
| N | 19674 | 1.6% |
| A | 19674 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1182843 | |
| Uppercase Letter | 39348 | 3.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| y | 359611 | |
| e | 359611 | |
| s | 359611 | |
| n | 52005 | 4.4% |
| o | 52005 | 4.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 19674 | |
| A | 19674 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1222191 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| y | 359611 | |
| e | 359611 | |
| s | 359611 | |
| n | 52005 | 4.3% |
| o | 52005 | 4.3% |
| N | 19674 | 1.6% |
| A | 19674 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1222191 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| y | 359611 | |
| e | 359611 | |
| s | 359611 | |
| n | 52005 | 4.3% |
| o | 52005 | 4.3% |
| N | 19674 | 1.6% |
| A | 19674 | 1.6% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| yes | |
|---|---|
| no | |
| NA | 24184 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.769074173 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1.194.274 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NA |
|---|---|
| 2nd row | NA |
| 3rd row | NA |
| 4th row | NA |
| 5th row | NA |
Common Values
| Value | Count | Frequency (%) |
| yes | 331694 | |
| no | 75412 | 17.5% |
| NA | 24184 | 5.6% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| yes | 331694 | |
| no | 75412 | 17.5% |
| na | 24184 | 5.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| y | 331694 | |
| e | 331694 | |
| s | 331694 | |
| n | 75412 | 6.3% |
| o | 75412 | 6.3% |
| N | 24184 | 2.0% |
| A | 24184 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1145906 | |
| Uppercase Letter | 48368 | 4.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| y | 331694 | |
| e | 331694 | |
| s | 331694 | |
| n | 75412 | 6.6% |
| o | 75412 | 6.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 24184 | |
| A | 24184 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1194274 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| y | 331694 | |
| e | 331694 | |
| s | 331694 | |
| n | 75412 | 6.3% |
| o | 75412 | 6.3% |
| N | 24184 | 2.0% |
| A | 24184 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1194274 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| y | 331694 | |
| e | 331694 | |
| s | 331694 | |
| n | 75412 | 6.3% |
| o | 75412 | 6.3% |
| N | 24184 | 2.0% |
| A | 24184 | 2.0% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| NA |
|---|
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 862.580 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NA |
|---|---|
| 2nd row | NA |
| 3rd row | NA |
| 4th row | NA |
| 5th row | NA |
Common Values
| Value | Count | Frequency (%) |
| NA | 431290 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| na | 431290 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 431290 | |
| A | 431290 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 862580 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 431290 | |
| A | 431290 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 862580 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 431290 | |
| A | 431290 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 862580 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 431290 | |
| A | 431290 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| 3+ | |
|---|---|
| 2 | |
| 1 | |
| NA | |
| 0 | 6850 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.449203552 |
| Min length | 1 |
Characters and Unicode
| Total characters | 625.027 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NA |
|---|---|
| 2nd row | NA |
| 3rd row | NA |
| 4th row | NA |
| 5th row | NA |
Common Values
| Value | Count | Frequency (%) |
| 3+ | 176069 | |
| 2 | 135142 | |
| 1 | 95561 | |
| NA | 17668 | 4.1% |
| 0 | 6850 | 1.6% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 3 | 176069 | |
| 2 | 135142 | |
| 1 | 95561 | |
| na | 17668 | 4.1% |
| 0 | 6850 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 176069 | |
| + | 176069 | |
| 2 | 135142 | |
| 1 | 95561 | |
| N | 17668 | 2.8% |
| A | 17668 | 2.8% |
| 0 | 6850 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 413622 | |
| Math Symbol | 176069 | |
| Uppercase Letter | 35336 | 5.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 176069 | |
| 2 | 135142 | |
| 1 | 95561 | |
| 0 | 6850 | 1.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 17668 | |
| A | 17668 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 176069 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 589691 | |
| Latin | 35336 | 5.7% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 176069 | |
| + | 176069 | |
| 2 | 135142 | |
| 1 | 95561 | |
| 0 | 6850 | 1.2% |
Latin
| Value | Count | Frequency (%) |
| N | 17668 | |
| A | 17668 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 625027 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 176069 | |
| + | 176069 | |
| 2 | 135142 | |
| 1 | 95561 | |
| N | 17668 | 2.8% |
| A | 17668 | 2.8% |
| 0 | 6850 | 1.1% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| 3+ | |
|---|---|
| 2 | |
| 1 | |
| 0 | |
| NA |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.372955552 |
| Min length | 1 |
Characters and Unicode
| Total characters | 592.142 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NA |
|---|---|
| 2nd row | NA |
| 3rd row | NA |
| 4th row | NA |
| 5th row | NA |
Common Values
| Value | Count | Frequency (%) |
| 3+ | 141643 | |
| 2 | 112124 | |
| 1 | 110264 | |
| 0 | 48050 | 11.1% |
| NA | 19209 | 4.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 3 | 141643 | |
| 2 | 112124 | |
| 1 | 110264 | |
| 0 | 48050 | 11.1% |
| na | 19209 | 4.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 141643 | |
| + | 141643 | |
| 2 | 112124 | |
| 1 | 110264 | |
| 0 | 48050 | 8.1% |
| N | 19209 | 3.2% |
| A | 19209 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 412081 | |
| Math Symbol | 141643 | 23.9% |
| Uppercase Letter | 38418 | 6.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 141643 | |
| 2 | 112124 | |
| 1 | 110264 | |
| 0 | 48050 | 11.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 19209 | |
| A | 19209 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 141643 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 553724 | |
| Latin | 38418 | 6.5% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 141643 | |
| + | 141643 | |
| 2 | 112124 | |
| 1 | 110264 | |
| 0 | 48050 | 8.7% |
Latin
| Value | Count | Frequency (%) |
| N | 19209 | |
| A | 19209 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 592142 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 141643 | |
| + | 141643 | |
| 2 | 112124 | |
| 1 | 110264 | |
| 0 | 48050 | 8.1% |
| N | 19209 | 3.2% |
| A | 19209 | 3.2% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| 1 | |
|---|---|
| 2 | |
| 0 | |
| 3+ | |
| NA |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.213712351 |
| Min length | 1 |
Characters and Unicode
| Total characters | 523.462 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NA |
|---|---|
| 2nd row | NA |
| 3rd row | NA |
| 4th row | NA |
| 5th row | NA |
Common Values
| Value | Count | Frequency (%) |
| 1 | 134766 | |
| 2 | 126242 | |
| 0 | 78110 | |
| 3+ | 66877 | |
| NA | 25295 | 5.9% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1 | 134766 | |
| 2 | 126242 | |
| 0 | 78110 | |
| 3 | 66877 | |
| na | 25295 | 5.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 134766 | |
| 2 | 126242 | |
| 0 | 78110 | |
| 3 | 66877 | |
| + | 66877 | |
| N | 25295 | 4.8% |
| A | 25295 | 4.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 405995 | |
| Math Symbol | 66877 | 12.8% |
| Uppercase Letter | 50590 | 9.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 134766 | |
| 2 | 126242 | |
| 0 | 78110 | |
| 3 | 66877 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 25295 | |
| A | 25295 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 66877 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 472872 | |
| Latin | 50590 | 9.7% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 134766 | |
| 2 | 126242 | |
| 0 | 78110 | |
| 3 | 66877 | |
| + | 66877 |
Latin
| Value | Count | Frequency (%) |
| N | 25295 | |
| A | 25295 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 523462 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 134766 | |
| 2 | 126242 | |
| 0 | 78110 | |
| 3 | 66877 | |
| + | 66877 | |
| N | 25295 | 4.8% |
| A | 25295 | 4.8% |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| 26-100 | |
|---|---|
| 0-10 | |
| 11-25 | |
| 101-200 | |
| 201-500 | |
| Other values (2) |
Length
| Max length | 13 |
|---|---|
| Median length | 7 |
| Mean length | 5.921199193 |
| Min length | 2 |
Characters and Unicode
| Total characters | 2.553.754 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | more than 500 |
|---|---|
| 2nd row | 201-500 |
| 3rd row | more than 500 |
| 4th row | 101-200 |
| 5th row | 11-25 |
Common Values
| Value | Count | Frequency (%) |
| 26-100 | 114839 | |
| 0-10 | 85151 | |
| 11-25 | 83872 | |
| 101-200 | 60314 | |
| 201-500 | 44070 | 10.2% |
| more than 500 | 26180 | 6.1% |
| NA | 16864 | 3.9% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 26-100 | 114839 | |
| 0-10 | 85151 | |
| 11-25 | 83872 | |
| 101-200 | 60314 | |
| 201-500 | 44070 | 9.1% |
| more | 26180 | 5.4% |
| than | 26180 | 5.4% |
| 500 | 26180 | 5.4% |
| na | 16864 | 3.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 765492 | |
| 1 | 532432 | |
| - | 388246 | |
| 2 | 303095 | 11.9% |
| 5 | 154122 | 6.0% |
| 6 | 114839 | 4.5% |
| 52360 | 2.1% | |
| t | 26180 | 1.0% |
| n | 26180 | 1.0% |
| a | 26180 | 1.0% |
| Other values (7) | 164628 | 6.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1869980 | |
| Dash Punctuation | 388246 | 15.2% |
| Lowercase Letter | 209440 | 8.2% |
| Space Separator | 52360 | 2.1% |
| Uppercase Letter | 33728 | 1.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 26180 | |
| n | 26180 | |
| a | 26180 | |
| h | 26180 | |
| r | 26180 | |
| e | 26180 | |
| o | 26180 | |
| m | 26180 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 765492 | |
| 1 | 532432 | |
| 2 | 303095 | 16.2% |
| 5 | 154122 | 8.2% |
| 6 | 114839 | 6.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 16864 | |
| A | 16864 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 388246 |
Space Separator
| Value | Count | Frequency (%) |
| 52360 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2310586 | |
| Latin | 243168 | 9.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 26180 | |
| n | 26180 | |
| a | 26180 | |
| h | 26180 | |
| r | 26180 | |
| e | 26180 | |
| o | 26180 | |
| m | 26180 | |
| N | 16864 | |
| A | 16864 |
Common
| Value | Count | Frequency (%) |
| 0 | 765492 | |
| 1 | 532432 | |
| - | 388246 | |
| 2 | 303095 | 13.1% |
| 5 | 154122 | 6.7% |
| 6 | 114839 | 5.0% |
| 52360 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2553754 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 765492 | |
| 1 | 532432 | |
| - | 388246 | |
| 2 | 303095 | 11.9% |
| 5 | 154122 | 6.0% |
| 6 | 114839 | 4.5% |
| 52360 | 2.1% | |
| t | 26180 | 1.0% |
| n | 26180 | 1.0% |
| a | 26180 | 1.0% |
| Other values (7) | 164628 | 6.4% |
| Distinct | 47482 |
|---|---|
| Distinct (%) | 11.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| NA | 14890 |
|---|---|
| 2.3517 | 731 |
| 2.3725 | 626 |
| 4.2559 | 471 |
| 4.1549 | 439 |
| Other values (47477) |
Length
| Max length | 21 |
|---|---|
| Median length | 7 |
| Mean length | 6.345829952 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2.736.893 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 7129 ? |
|---|---|
| Unique (%) | 1.7% |
Sample
| 1st row | NA |
|---|---|
| 2nd row | NA |
| 3rd row | NA |
| 4th row | NA |
| 5th row | NA |
Common Values
| Value | Count | Frequency (%) |
| NA | 14890 | 3.5% |
| 2.3517 | 731 | 0.2% |
| 2.3725 | 626 | 0.1% |
| 4.2559 | 471 | 0.1% |
| 4.1549 | 439 | 0.1% |
| -3.7458 | 385 | 0.1% |
| -3.2413 | 317 | 0.1% |
| 2.752 | 301 | 0.1% |
| 2.7852 | 265 | 0.1% |
| 3.3323 | 237 | 0.1% |
| Other values (47472) | 412628 |
Length
| Value | Count | Frequency (%) |
| na | 14890 | 3.5% |
| 2.3517 | 732 | 0.2% |
| 2.3725 | 626 | 0.1% |
| 4.2559 | 471 | 0.1% |
| 4.1549 | 439 | 0.1% |
| 3.7458 | 385 | 0.1% |
| 3.2413 | 317 | 0.1% |
| 2.752 | 301 | 0.1% |
| 2.7852 | 268 | 0.1% |
| 3.3323 | 237 | 0.1% |
| Other values (29640) | 412624 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 416066 | |
| 0 | 398810 | |
| 1 | 286211 | |
| - | 247557 | |
| 2 | 214134 | |
| 3 | 184585 | |
| 4 | 170943 | |
| 5 | 165398 | 6.0% |
| 6 | 159873 | 5.8% |
| 7 | 159172 | 5.8% |
| Other values (5) | 334144 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2043195 | |
| Other Punctuation | 416066 | 15.2% |
| Dash Punctuation | 247557 | 9.0% |
| Uppercase Letter | 29780 | 1.1% |
| Lowercase Letter | 295 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 398810 | |
| 1 | 286211 | |
| 2 | 214134 | |
| 3 | 184585 | |
| 4 | 170943 | |
| 5 | 165398 | |
| 6 | 159873 | |
| 7 | 159172 | 7.8% |
| 8 | 153398 | 7.5% |
| 9 | 150671 | 7.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 14890 | |
| A | 14890 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 416066 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 247557 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 295 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2706818 | |
| Latin | 30075 | 1.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 416066 | |
| 0 | 398810 | |
| 1 | 286211 | |
| - | 247557 | |
| 2 | 214134 | |
| 3 | 184585 | |
| 4 | 170943 | |
| 5 | 165398 | 6.1% |
| 6 | 159873 | 5.9% |
| 7 | 159172 | 5.9% |
| Other values (2) | 304069 |
Latin
| Value | Count | Frequency (%) |
| N | 14890 | |
| A | 14890 | |
| e | 295 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2736893 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 416066 | |
| 0 | 398810 | |
| 1 | 286211 | |
| - | 247557 | |
| 2 | 214134 | |
| 3 | 184585 | |
| 4 | 170943 | |
| 5 | 165398 | 6.0% |
| 6 | 159873 | 5.8% |
| 7 | 159172 | 5.8% |
| Other values (5) | 334144 |
| Distinct | 52733 |
|---|---|
| Distinct (%) | 12.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| NA | 14944 |
|---|---|
| 0.8731 | 30 |
| 0.4149 | 28 |
| -0.0998 | 27 |
| 0.5469 | 27 |
| Other values (52728) |
Length
| Max length | 21 |
|---|---|
| Median length | 20 |
| Mean length | 6.295413759 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2.715.149 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 7650 ? |
|---|---|
| Unique (%) | 1.8% |
Sample
| 1st row | 0.30620000000000003 |
|---|---|
| 2nd row | 1.478 |
| 3rd row | NA |
| 4th row | 0.2243 |
| 5th row | 0.1467 |
Common Values
| Value | Count | Frequency (%) |
| NA | 14944 | 3.5% |
| 0.8731 | 30 | < 0.1% |
| 0.4149 | 28 | < 0.1% |
| -0.0998 | 27 | < 0.1% |
| 0.5469 | 27 | < 0.1% |
| 0.7977 | 27 | < 0.1% |
| 0.7139 | 26 | < 0.1% |
| 0.9498 | 26 | < 0.1% |
| 0.449 | 26 | < 0.1% |
| 0.3348 | 26 | < 0.1% |
| Other values (52723) | 416103 |
Length
| Value | Count | Frequency (%) |
| na | 14944 | 3.5% |
| 0.449 | 48 | < 0.1% |
| 0.9498 | 46 | < 0.1% |
| 0.7838 | 45 | < 0.1% |
| 0.7139 | 45 | < 0.1% |
| 0.5063 | 44 | < 0.1% |
| 0.1493 | 44 | < 0.1% |
| 0.992 | 44 | < 0.1% |
| 0.4149 | 44 | < 0.1% |
| 0.625 | 43 | < 0.1% |
| Other values (32866) | 415943 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 416085 | |
| 0 | 400310 | |
| 1 | 304132 | |
| - | 226213 | |
| 2 | 199685 | |
| 3 | 174738 | |
| 4 | 167007 | |
| 5 | 163819 | 6.0% |
| 6 | 161067 | 5.9% |
| 7 | 160069 | 5.9% |
| Other values (5) | 342024 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2042737 | |
| Other Punctuation | 416085 | 15.3% |
| Dash Punctuation | 226213 | 8.3% |
| Uppercase Letter | 29888 | 1.1% |
| Lowercase Letter | 226 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 400310 | |
| 1 | 304132 | |
| 2 | 199685 | |
| 3 | 174738 | |
| 4 | 167007 | |
| 5 | 163819 | |
| 6 | 161067 | |
| 7 | 160069 | 7.8% |
| 8 | 157114 | 7.7% |
| 9 | 154796 | 7.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 14944 | |
| A | 14944 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 416085 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 226213 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 226 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2685035 | |
| Latin | 30114 | 1.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 416085 | |
| 0 | 400310 | |
| 1 | 304132 | |
| - | 226213 | |
| 2 | 199685 | |
| 3 | 174738 | |
| 4 | 167007 | |
| 5 | 163819 | 6.1% |
| 6 | 161067 | 6.0% |
| 7 | 160069 | 6.0% |
| Other values (2) | 311910 |
Latin
| Value | Count | Frequency (%) |
| N | 14944 | |
| A | 14944 | |
| e | 226 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2715149 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 416085 | |
| 0 | 400310 | |
| 1 | 304132 | |
| - | 226213 | |
| 2 | 199685 | |
| 3 | 174738 | |
| 4 | 167007 | |
| 5 | 163819 | 6.0% |
| 6 | 161067 | 5.9% |
| 7 | 160069 | 5.9% |
| Other values (5) | 342024 |
| Distinct | 60 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| BRA | 23141 |
|---|---|
| CAN | 20058 |
| AUS | 14530 |
| ARE | 14167 |
| GBR | 14157 |
| Other values (55) |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1.293.870 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ISR |
|---|---|
| 2nd row | ISR |
| 3rd row | ISR |
| 4th row | ISR |
| 5th row | ISR |
Common Values
| Value | Count | Frequency (%) |
| BRA | 23141 | 5.4% |
| CAN | 20058 | 4.7% |
| AUS | 14530 | 3.4% |
| ARE | 14167 | 3.3% |
| GBR | 14157 | 3.3% |
| QAT | 12083 | 2.8% |
| COL | 11795 | 2.7% |
| ITA | 11583 | 2.7% |
| BEL | 9651 | 2.2% |
| THA | 8249 | 1.9% |
| Other values (50) | 291876 |
Length
| Value | Count | Frequency (%) |
| bra | 23141 | 5.4% |
| can | 20058 | 4.7% |
| aus | 14530 | 3.4% |
| are | 14167 | 3.3% |
| gbr | 14157 | 3.3% |
| qat | 12083 | 2.8% |
| col | 11795 | 2.7% |
| ita | 11583 | 2.7% |
| bel | 9651 | 2.2% |
| tha | 8249 | 1.9% |
| Other values (50) | 291876 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 149516 | 11.6% |
| A | 143566 | 11.1% |
| N | 95098 | 7.3% |
| L | 82082 | 6.3% |
| E | 79483 | 6.1% |
| U | 79479 | 6.1% |
| T | 73263 | 5.7% |
| S | 72899 | 5.6% |
| B | 62638 | 4.8% |
| C | 57164 | 4.4% |
| Other values (16) | 398682 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1293870 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 149516 | 11.6% |
| A | 143566 | 11.1% |
| N | 95098 | 7.3% |
| L | 82082 | 6.3% |
| E | 79483 | 6.1% |
| U | 79479 | 6.1% |
| T | 73263 | 5.7% |
| S | 72899 | 5.6% |
| B | 62638 | 4.8% |
| C | 57164 | 4.4% |
| Other values (16) | 398682 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1293870 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 149516 | 11.6% |
| A | 143566 | 11.1% |
| N | 95098 | 7.3% |
| L | 82082 | 6.3% |
| E | 79483 | 6.1% |
| U | 79479 | 6.1% |
| T | 73263 | 5.7% |
| S | 72899 | 5.6% |
| B | 62638 | 4.8% |
| C | 57164 | 4.4% |
| Other values (16) | 398682 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1293870 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 149516 | 11.6% |
| A | 143566 | 11.1% |
| N | 95098 | 7.3% |
| L | 82082 | 6.3% |
| E | 79483 | 6.1% |
| U | 79479 | 6.1% |
| T | 73263 | 5.7% |
| S | 72899 | 5.6% |
| B | 62638 | 4.8% |
| C | 57164 | 4.4% |
| Other values (16) | 398682 |
| Distinct | 60 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| Brazil | 23141 |
|---|---|
| Canada | 20058 |
| Australia | 14530 |
| United Arab Emirates | 14167 |
| United Kingdom | 14157 |
| Other values (55) |
Length
| Max length | 20 |
|---|---|
| Median length | 18 |
| Mean length | 7.920786478 |
| Min length | 4 |
Characters and Unicode
| Total characters | 3.416.156 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Israel |
|---|---|
| 2nd row | Israel |
| 3rd row | Israel |
| 4th row | Israel |
| 5th row | Israel |
Common Values
| Value | Count | Frequency (%) |
| Brazil | 23141 | 5.4% |
| Canada | 20058 | 4.7% |
| Australia | 14530 | 3.4% |
| United Arab Emirates | 14167 | 3.3% |
| United Kingdom | 14157 | 3.3% |
| Qatar | 12083 | 2.8% |
| Colombia | 11795 | 2.7% |
| Italy | 11583 | 2.7% |
| Belgium | 9651 | 2.2% |
| Thailand | 8249 | 1.9% |
| Other values (50) | 291876 |
Length
| Value | Count | Frequency (%) |
| united | 34036 | 6.8% |
| brazil | 23141 | 4.6% |
| canada | 20058 | 4.0% |
| australia | 14530 | 2.9% |
| arab | 14167 | 2.8% |
| emirates | 14167 | 2.8% |
| kingdom | 14157 | 2.8% |
| qatar | 12083 | 2.4% |
| colombia | 11795 | 2.4% |
| italy | 11583 | 2.3% |
| Other values (56) | 331483 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 508425 | |
| i | 295083 | 8.6% |
| n | 258011 | 7.6% |
| e | 246909 | 7.2% |
| r | 223850 | 6.6% |
| l | 178279 | 5.2% |
| t | 173762 | 5.1% |
| o | 171406 | 5.0% |
| d | 136300 | 4.0% |
| u | 113944 | 3.3% |
| Other values (36) | 1110187 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2845046 | |
| Uppercase Letter | 501200 | 14.7% |
| Space Separator | 69910 | 2.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 508425 | |
| i | 295083 | |
| n | 258011 | |
| e | 246909 | |
| r | 223850 | |
| l | 178279 | 6.3% |
| t | 173762 | 6.1% |
| o | 171406 | 6.0% |
| d | 136300 | 4.8% |
| u | 113944 | 4.0% |
| Other values (13) | 539077 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 51581 | 10.3% |
| S | 48218 | 9.6% |
| A | 46438 | 9.3% |
| U | 40098 | 8.0% |
| B | 38720 | 7.7% |
| I | 33806 | 6.7% |
| R | 22518 | 4.5% |
| M | 22192 | 4.4% |
| L | 21239 | 4.2% |
| E | 19754 | 3.9% |
| Other values (12) | 156636 |
Space Separator
| Value | Count | Frequency (%) |
| 69910 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3346246 | |
| Common | 69910 | 2.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 508425 | |
| i | 295083 | 8.8% |
| n | 258011 | 7.7% |
| e | 246909 | 7.4% |
| r | 223850 | 6.7% |
| l | 178279 | 5.3% |
| t | 173762 | 5.2% |
| o | 171406 | 5.1% |
| d | 136300 | 4.1% |
| u | 113944 | 3.4% |
| Other values (35) | 1040277 |
Common
| Value | Count | Frequency (%) |
| 69910 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3416156 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 508425 | |
| i | 295083 | 8.6% |
| n | 258011 | 7.6% |
| e | 246909 | 7.2% |
| r | 223850 | 6.6% |
| l | 178279 | 5.2% |
| t | 173762 | 5.1% |
| o | 171406 | 5.0% |
| d | 136300 | 4.0% |
| u | 113944 | 3.3% |
| Other values (36) | 1110187 |
| Distinct | 60 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47.75911799 |
| Minimum | 1 |
|---|---|
| Maximum | 127 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 19 |
| median | 40 |
| Q3 | 75 |
| 95-th percentile | 116 |
| Maximum | 127 |
| Range | 126 |
| Interquartile range (IQR) | 56 |
Descriptive statistics
| Standard deviation | 33.26787061 |
|---|---|
| Coefficient of variation (CV) | 0.696576319 |
| Kurtosis | -0.5933703325 |
| Mean | 47.75911799 |
| Median Absolute Deviation (MAD) | 25 |
| Skewness | 0.6278080231 |
| Sum | 20598030 |
| Variance | 1106.751215 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 81 | 23141 | 5.4% |
| 8 | 20058 | 4.7% |
| 15 | 14530 | 3.4% |
| 73 | 14167 | 3.3% |
| 14 | 14157 | 3.3% |
| 127 | 12083 | 2.8% |
| 108 | 11795 | 2.7% |
| 34 | 11583 | 2.7% |
| 16 | 9651 | 2.2% |
| 61 | 8249 | 1.9% |
| Other values (50) | 291876 |
| Value | Count | Frequency (%) |
| 1 | 6115 | 1.4% |
| 4 | 6647 | 1.5% |
| 5 | 5581 | 1.3% |
| 8 | 20058 | |
| 9 | 5385 | 1.2% |
| 10 | 5860 | 1.4% |
| 11 | 5882 | 1.4% |
| 14 | 14157 | |
| 15 | 14530 | |
| 16 | 9651 |
| Value | Count | Frequency (%) |
| 127 | 12083 | |
| 117 | 5215 | 1.2% |
| 116 | 4740 | 1.1% |
| 108 | 11795 | |
| 99 | 4546 | 1.1% |
| 96 | 6971 | 1.6% |
| 95 | 5519 | 1.3% |
| 91 | 5665 | 1.3% |
| 85 | 5375 | 1.2% |
| 81 | 23141 |
| Distinct | 60 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
| Brazil | 23141 |
|---|---|
| Canada | 20058 |
| Australia | 14530 |
| United Arab Emirates | 14167 |
| United Kingdom | 14157 |
| Other values (55) |
Length
| Max length | 20 |
|---|---|
| Median length | 18 |
| Mean length | 7.920786478 |
| Min length | 4 |
Characters and Unicode
| Total characters | 3.416.156 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Israel |
|---|---|
| 2nd row | Israel |
| 3rd row | Israel |
| 4th row | Israel |
| 5th row | Israel |
Common Values
| Value | Count | Frequency (%) |
| Brazil | 23141 | 5.4% |
| Canada | 20058 | 4.7% |
| Australia | 14530 | 3.4% |
| United Arab Emirates | 14167 | 3.3% |
| United Kingdom | 14157 | 3.3% |
| Qatar | 12083 | 2.8% |
| Colombia | 11795 | 2.7% |
| Italy | 11583 | 2.7% |
| Belgium | 9651 | 2.2% |
| Thailand | 8249 | 1.9% |
| Other values (50) | 291876 |
Length
| Value | Count | Frequency (%) |
| united | 34036 | 6.8% |
| brazil | 23141 | 4.6% |
| canada | 20058 | 4.0% |
| australia | 14530 | 2.9% |
| arab | 14167 | 2.8% |
| emirates | 14167 | 2.8% |
| kingdom | 14157 | 2.8% |
| qatar | 12083 | 2.4% |
| colombia | 11795 | 2.4% |
| italy | 11583 | 2.3% |
| Other values (56) | 331483 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 508425 | |
| i | 295083 | 8.6% |
| n | 258011 | 7.6% |
| e | 246909 | 7.2% |
| r | 223850 | 6.6% |
| l | 178279 | 5.2% |
| t | 173762 | 5.1% |
| o | 171406 | 5.0% |
| d | 136300 | 4.0% |
| u | 113944 | 3.3% |
| Other values (36) | 1110187 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2845046 | |
| Uppercase Letter | 501200 | 14.7% |
| Space Separator | 69910 | 2.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 508425 | |
| i | 295083 | |
| n | 258011 | |
| e | 246909 | |
| r | 223850 | |
| l | 178279 | 6.3% |
| t | 173762 | 6.1% |
| o | 171406 | 6.0% |
| d | 136300 | 4.8% |
| u | 113944 | 4.0% |
| Other values (13) | 539077 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 51581 | 10.3% |
| S | 48218 | 9.6% |
| A | 46438 | 9.3% |
| U | 40098 | 8.0% |
| B | 38720 | 7.7% |
| I | 33806 | 6.7% |
| R | 22518 | 4.5% |
| M | 22192 | 4.4% |
| L | 21239 | 4.2% |
| E | 19754 | 3.9% |
| Other values (12) | 156636 |
Space Separator
| Value | Count | Frequency (%) |
| 69910 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3346246 | |
| Common | 69910 | 2.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 508425 | |
| i | 295083 | 8.8% |
| n | 258011 | 7.7% |
| e | 246909 | 7.4% |
| r | 223850 | 6.7% |
| l | 178279 | 5.3% |
| t | 173762 | 5.2% |
| o | 171406 | 5.1% |
| d | 136300 | 4.1% |
| u | 113944 | 3.4% |
| Other values (35) | 1040277 |
Common
| Value | Count | Frequency (%) |
| 69910 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3416156 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 508425 | |
| i | 295083 | 8.6% |
| n | 258011 | 7.6% |
| e | 246909 | 7.2% |
| r | 223850 | 6.6% |
| l | 178279 | 5.2% |
| t | 173762 | 5.1% |
| o | 171406 | 5.0% |
| d | 136300 | 4.0% |
| u | 113944 | 3.3% |
| Other values (36) | 1110187 |
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 92.99102692 |
| Minimum | 80 |
|---|---|
| Maximum | 107 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 80 |
|---|---|
| 5-th percentile | 82 |
| Q1 | 86 |
| median | 95 |
| Q3 | 98 |
| 95-th percentile | 100 |
| Maximum | 107 |
| Range | 27 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 6.581675722 |
|---|---|
| Coefficient of variation (CV) | 0.07077753564 |
| Kurtosis | -1.059684708 |
| Mean | 92.99102692 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -0.2670780865 |
| Sum | 40106100 |
| Variance | 43.31845531 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 99 | 50932 | |
| 85 | 40694 | 9.4% |
| 98 | 37969 | 8.8% |
| 100 | 37185 | 8.6% |
| 94 | 32015 | 7.4% |
| 97 | 30544 | 7.1% |
| 96 | 28833 | 6.7% |
| 87 | 21735 | 5.0% |
| 89 | 21197 | 4.9% |
| 86 | 19449 | 4.5% |
| Other values (11) | 110737 |
| Value | Count | Frequency (%) |
| 80 | 12083 | 2.8% |
| 82 | 9955 | 2.3% |
| 83 | 11795 | 2.7% |
| 84 | 17036 | |
| 85 | 40694 | |
| 86 | 19449 | |
| 87 | 21735 | |
| 89 | 21197 | |
| 90 | 6062 | 1.4% |
| 91 | 4876 | 1.1% |
| Value | Count | Frequency (%) |
| 107 | 6115 | 1.4% |
| 104 | 12228 | 2.8% |
| 100 | 37185 | |
| 99 | 50932 | |
| 98 | 37969 | |
| 97 | 30544 | |
| 96 | 28833 | |
| 95 | 13802 | 3.2% |
| 94 | 32015 | |
| 93 | 11460 | 2.7% |
| Distinct | 60 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47109.44445 |
| Minimum | 343.353 |
|---|---|
| Maximum | 332915.073 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.3 MiB |
Quantile statistics
| Minimum | 343.353 |
|---|---|
| 5-th percentile | 1325.185 |
| Q1 | 5465.63 |
| median | 11632.326 |
| Q3 | 60367.477 |
| 95-th percentile | 213993.437 |
| Maximum | 332915.073 |
| Range | 332571.72 |
| Interquartile range (IQR) | 54901.847 |
Descriptive statistics
| Standard deviation | 69048.93932 |
|---|---|
| Coefficient of variation (CV) | 1.465713301 |
| Kurtosis | 4.751062076 |
| Mean | 47109.44445 |
| Median Absolute Deviation (MAD) | 9765.384 |
| Skewness | 2.241364252 |
| Sum | 2.03178323 × 1010 |
| Variance | 4767756021 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 213993.437 | 23141 | 5.4% |
| 38067.903 | 20058 | 4.7% |
| 25788.215 | 14530 | 3.4% |
| 9991.089 | 14167 | 3.3% |
| 68207.116 | 14157 | 3.3% |
| 2930.528 | 12083 | 2.8% |
| 51265.844 | 11795 | 2.7% |
| 60367.477 | 11583 | 2.7% |
| 11632.326 | 9651 | 2.2% |
| 69950.85 | 8249 | 1.9% |
| Other values (50) | 291876 |
| Value | Count | Frequency (%) |
| 343.353 | 3371 | 0.8% |
| 442.784 | 3634 | 0.8% |
| 628.053 | 5665 | |
| 634.814 | 5299 | |
| 1325.185 | 5587 | |
| 1866.942 | 4869 | |
| 2078.724 | 6406 | |
| 2689.862 | 6525 | |
| 2872.933 | 5215 | |
| 2930.528 | 12083 |
| Value | Count | Frequency (%) |
| 332915.073 | 5712 | 1.3% |
| 276361.783 | 6513 | 1.5% |
| 213993.437 | 23141 | |
| 145912.025 | 6036 | 1.4% |
| 130262.216 | 7568 | 1.8% |
| 126050.804 | 6647 | 1.5% |
| 98168.833 | 5826 | 1.4% |
| 85042.738 | 5895 | 1.4% |
| 83900.473 | 6504 | 1.5% |
| 69950.85 | 8249 | 1.9% |
Auto
The auto setting is an easily interpretable pairwise column metric of the following mapping: vartype-vartype : method, categorical-categorical : Cramer's V, numerical-categorical : Cramer's V (using a discretized numerical column), numerical-numerical : Spearman's ρ. This configuration uses the best suitable for each pair of columns.Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| year | country | school_id | student_id | mother_educ | father_educ | gender | computer | internet | math | read | science | stu_wgt | desk | room | dishwasher | television | computer_n | car | book | wealth | escs | country_2 | country_name | rank | country_3 | finalIq | pop2021 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2015 | ISR | 37600145 | 37603048 | ISCED 3A | ISCED 3A | female | NA | NA | 524.233 | 517.321 | 517.577 | 16.66115 | yes | NA | NA | NA | NA | NA | more than 500 | NA | 0.30620000000000003 | ISR | Israel | 44 | Israel | 94 | 8789.774 |
| 1 | 2015 | ISR | 37600145 | 37602416 | ISCED 3A | NA | female | NA | NA | 538.210 | 519.036 | 523.982 | 16.66115 | yes | NA | NA | NA | NA | NA | 201-500 | NA | 1.478 | ISR | Israel | 44 | Israel | 94 | 8789.774 |
| 2 | 2015 | ISR | 37600145 | 37605312 | NA | NA | female | NA | NA | 589.116 | 621.705 | 607.757 | 16.66115 | NA | NA | NA | NA | NA | NA | more than 500 | NA | NA | ISR | Israel | 44 | Israel | 94 | 8789.774 |
| 3 | 2015 | ISR | 37600145 | 37601743 | ISCED 3A | ISCED 3B, C | female | NA | NA | 532.836 | 548.028 | 478.319 | 16.66115 | yes | NA | NA | NA | NA | NA | 101-200 | NA | 0.2243 | ISR | Israel | 44 | Israel | 94 | 8789.774 |
| 4 | 2015 | ISR | 37600145 | 37603927 | ISCED 3B, C | NA | female | NA | NA | 478.684 | 550.993 | 483.711 | 16.66115 | yes | NA | NA | NA | NA | NA | 11-25 | NA | 0.1467 | ISR | Israel | 44 | Israel | 94 | 8789.774 |
| 5 | 2015 | ISR | 37600145 | 37601586 | ISCED 3B, C | ISCED 3B, C | female | NA | NA | 574.335 | 598.832 | 558.763 | 16.66115 | yes | NA | NA | NA | NA | NA | 26-100 | NA | 0.7979 | ISR | Israel | 44 | Israel | 94 | 8789.774 |
| 6 | 2015 | ISR | 37600145 | 37605625 | NA | NA | female | NA | NA | 336.421 | 357.691 | 334.377 | 16.66115 | NA | NA | NA | NA | NA | NA | NA | NA | NA | ISR | Israel | 44 | Israel | 94 | 8789.774 |
| 7 | 2015 | ISR | 37600145 | 37605647 | ISCED 3A | ISCED 1 | female | NA | NA | 573.293 | 575.490 | 523.413 | 16.66115 | yes | NA | NA | NA | NA | NA | more than 500 | NA | 1.7141 | ISR | Israel | 44 | Israel | 94 | 8789.774 |
| 8 | 2015 | ISR | 37600145 | 37604557 | NA | NA | female | NA | NA | 532.381 | 457.606 | 496.095 | 16.66115 | yes | NA | NA | NA | NA | NA | 101-200 | NA | 0.6028 | ISR | Israel | 44 | Israel | 94 | 8789.774 |
| 9 | 2015 | ISR | 37600145 | 37600526 | ISCED 3A | ISCED 3A | female | NA | NA | 514.964 | 587.544 | 522.694 | 16.66115 | yes | NA | NA | NA | NA | NA | 26-100 | NA | 0.1493 | ISR | Israel | 44 | Israel | 94 | 8789.774 |
Last rows
| year | country | school_id | student_id | mother_educ | father_educ | gender | computer | internet | math | read | science | stu_wgt | desk | room | dishwasher | television | computer_n | car | book | wealth | escs | country_2 | country_name | rank | country_3 | finalIq | pop2021 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 431280 | 2015 | GBR | 82650109 | 82652930 | ISCED 3A | ISCED 3A | female | yes | yes | 530.299 | 481.074 | 546.442 | 16.96736 | yes | yes | NA | 3+ | 3+ | 2 | 26-100 | 1.2783 | 1.0714 | GBR | United Kingdom | 14 | United Kingdom | 99 | 68207.116 |
| 431281 | 2015 | GBR | 82650109 | 82650407 | ISCED 3A | ISCED 3B, C | female | yes | yes | 499.617 | 449.433 | 451.770 | 16.96736 | no | yes | NA | 3+ | 1 | 2 | 26-100 | -0.2818 | 0.6488 | GBR | United Kingdom | 14 | United Kingdom | 99 | 68207.116 |
| 431282 | 2015 | GBR | 82650109 | 82653069 | ISCED 3A | ISCED 3A | female | yes | yes | 515.442 | 604.943 | 581.118 | 16.96736 | yes | yes | NA | 3+ | 3+ | 2 | 101-200 | 1.4826 | 1.1976 | GBR | United Kingdom | 14 | United Kingdom | 99 | 68207.116 |
| 431283 | 2015 | GBR | 82650109 | 82650368 | ISCED 2 | ISCED 3A | female | yes | yes | 518.956 | 436.519 | 468.840 | 16.96736 | yes | yes | NA | 3+ | 3+ | 2 | 201-500 | 0.7857 | 1.3324 | GBR | United Kingdom | 14 | United Kingdom | 99 | 68207.116 |
| 431284 | 2015 | GBR | 82650109 | 82654153 | ISCED 3A | ISCED 3A | female | yes | yes | 603.154 | 598.225 | 604.463 | 16.96736 | yes | yes | NA | 1 | 3+ | 1 | more than 500 | -0.1614 | 1.1585 | GBR | United Kingdom | 14 | United Kingdom | 99 | 68207.116 |
| 431285 | 2015 | GBR | 82650109 | 82654468 | ISCED 3A | ISCED 3A | female | yes | yes | 640.392 | 617.868 | 641.736 | 16.96736 | yes | yes | NA | 2 | 2 | 2 | 11-25 | 1.1876 | 0.6643 | GBR | United Kingdom | 14 | United Kingdom | 99 | 68207.116 |
| 431286 | 2015 | GBR | 82650109 | 82653067 | ISCED 3A | ISCED 3A | female | yes | yes | 568.223 | 514.913 | 520.089 | 16.96736 | yes | yes | NA | 3+ | 3+ | 1 | 201-500 | 0.4932 | 0.9365 | GBR | United Kingdom | 14 | United Kingdom | 99 | 68207.116 |
| 431287 | 2015 | GBR | 82650109 | 82651934 | ISCED 2 | ISCED 2 | female | yes | yes | 553.587 | 590.505 | 545.729 | 16.96736 | yes | yes | NA | 3+ | 3+ | 3+ | 11-25 | 2.1047 | -0.4144 | GBR | United Kingdom | 14 | United Kingdom | 99 | 68207.116 |
| 431288 | 2015 | GBR | 82650109 | 82652209 | ISCED 3A | ISCED 3A | female | yes | yes | 536.515 | 500.004 | 503.055 | 16.96736 | yes | yes | NA | 2 | 3+ | 2 | 101-200 | 1.7227 | 1.3572 | GBR | United Kingdom | 14 | United Kingdom | 99 | 68207.116 |
| 431289 | 2015 | GBR | 82650109 | 82654177 | ISCED 3A | ISCED 3B, C | female | yes | yes | 413.326 | 288.821 | 393.726 | 16.96736 | yes | yes | NA | 2 | 2 | 2 | 26-100 | 0.1245 | 1.1504 | GBR | United Kingdom | 14 | United Kingdom | 99 | 68207.116 |